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SITE-DIRECTED MUTAGENESIS OF 
ESCHERICHIA COLI PHYTASE 

The present application claims the benefit of U.S. Provisional 
5 Patent Application Serial No. 60/166,179, filed November 18, 1999. 

FIELD OF THE INVENTION 

The present invention is directed to the site-directed mutagenesis of 
Escherichia coli phosphatase/phytase. 

10 

BACKGROUND OF THE INVENTION 

Phytases, a specific group of monoester phosphatases, are required 
to initiate the release of phosphate ("P") from phytate (myo-inositol 
hexophosphate), the major storage form of P in cereal foods or feeds (Reddy, N.R. 

1 5 et ah, "Phytates in Legumes and Cereals," Advances in Food Research . 28: 1 
(1982)). Because simple-stomached animals like swine and poultry as well as 
humans have little phytase activity in their gastrointestinal tracts, nearly all of the 
ingested phytate P is indigestible. This results in the need for supplementation of 
inorganic P, an expensive and non-renewable nutrient, in diets for these animals. 

20 More undesirably, the unutilized phytate-P excreted through manure of these 
animals becomes P pollution of the environment (Cromwell, G.L. et al., "P- A 
Key Essential Nutrient, Yet a Possible Major Pollutant « Its Central Role in 
Animal Nutrition," Biotechnology In the Feed Industry : Proceedings Alltech 7th 
Annual Symposium, p. 133 (1991)). Furthermore, phytate chelates with essential 

25 trace elements like zinc and produces nutrient deficiencies such as growth and 
mental retardation in children ingesting mainly plant origin foods without removal 
of phytate. 

Two phytases, phyA and phyB, from Aspergillus niger NRRL3135 
have been cloned and sequenced (Ehrlich, K.C. et al., "Identification and Cloning 
30 of a Second Phytase Gene (phys) from Aspergillus niger (ficuum)" Biochem. 

Biophvs. Res. Commun.. 195:53-57 (1993); Piddington, C.S. et al., "The Cloning 
and Sequencing of the Genes Encoding Phytase (phy) and pH 2.5-optimum Acid 
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Phosphatase (aph) from Aspergillus niger var. awamori" Gene , 1 33:56-62 
(1993)). Recently, new phytase genes have been isolated from Aspergillus terreus 
and Myceliophthora thermophila (Mitchell et al., "The Phytase Subfamily of 
Histidine Acid Phosphatases: Isolation of Genes for Two Novel Phytases From 
5 the Fungi Aspergillus terreus and Myceliophthora thermophila'" Microbiology 
143:245-52, (1997)), Aspergillus fumigatus (Pasamontes et al., "Gene Cloning, 
Purification, and Characterization of a Heat-Stable Phytase from the Fungus 
Aspergillus fumigatus" Appl. Environ. Microbiol . 63:1696-700 (1997)), 
Emericella nidulans and Talaromyces thermophilus (Pasamontes et al., "Cloning 
10 of the Phytase from Emericella nidulans and the Thermophilic Fungus 

Talaromyces thermophilus" Biochim. Biophvs. Acta. . 1353:217-23 (1997)), and 
maize (Maugenest et al., "Cloning and Characterization of a cDNA Encoding a 
Maize Seedling Phytase " Biochem. J. 322:51 1-17 (1997)). 

Various types of phytase enzymes have been isolated and/or 
1 5 purified from Enterobacter sp.4 (Yoon et al., "Isolation and Identification of 
Phytase-Producing Bacterium, Enterobacter sp. 4, and Enzymatic Properties of 
Phytase Enzyme.." Enzyme and Microbial Technology 18:449-54 (1996)), 
Klebsiella terrigena (Greiner et al., "Purification and Characterization of a 
Phytase from Klebsiella terrigena" Arch. Biochem. Biophvs. 341 :20 1-06 (1997)), 
20 and Bacillus sp. DS11 (Kim et al., "Purification and Properties of a Thermostable 
Phytase from Bacillus sp. DS11" Enzyme and Microbial Technology 22:2-7 
(1998)). Properties of these enzyme have been studied. In addition, the crystal 
structure of phyA from Aspergillus ficuum has been reported (Kostrewa et ah, 
"Crystal Structure of Phytase from Aspergillus ficuum at 2.5 A Resolution ," 
25 Nature Structure Biology 4:185-90 (1997)). 1 

Hartingsveldt et al. introduced phyA gene into A. niger and 
obtained a ten-fold increase of phytase activity compared to the wild type. 
("Cloning, Characterization and Overexpression of the Phytase-Encoding Gene 
(phyA) of Aspergillus Niger" Gene 127:87-94 (1993)). Supplemental microbial 
30 phytase of this source in the diets for pigs and poultry has been shown to be 
effective in improving utilization of phytate-P and zinc (Simons et al., 
"Improvement of Phosphorus Availability By Microbial Phytase in Broilers and 
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Pigs," Br. J. Nutr. . 64:525 (1990); Lei, X.G. et al., "Supplementing Corn-Soybean 
Meal Diets With Microbial Phytase Linearly Improves Phytate P Utilization by 
Weaning Pigs," J. Anim. ScL 71 :3359 (1993); Lei, X.G. et al., "Supplementing 
Corn-Soybean Meal Diets With Microbial Phytase Maximizes Phytate P 
5 Utilization by Weaning Pigs," J. Anim. Sci. , 71:3368 (1993); Cromwell, G.L. et 
al., "P- A Key Essential Nutrient, Yet a Possible Major Pollutant — Its Central 
Role in Animal Nutrition," Biotechnology In the Feed Industry ; Proceedings 
Alltech 7th Annual Symposium, p. 133 (1991)). However, the cost of the limited 
commercial phytase supply and its instability when subjected to heat during feed 
10 pelleting preclude its practical use in animal industry (Jongbloed, A.W. et al., 

"Effect of Pelleting Mixed Feeds on Phytase Activity and Apparent Absorbability 
of Phosphorus and Calcium in Pigs," Animal Feed Science and Technology , 
28:233-42 (1990)). Moreover, phytase produced from A. niger is presumably not 
the safest source for human food manufacturing. 

] 5 Thus, there is a need to improve phytase production for application 

by the food and feed industry. 



SUMMARY OF THE INVENTION 

The present invention relates to an isolated mutant acid 
20 phosphatase/phytase which is produced by making a plurality of amino acid 

substitutions in a wild-type Escherichia coli acid phosphatase/phytase having an 
amino acid sequence of SEQ. ID. No. 1 . These amino acid substitutions are made 
at positions 200, 207, and 21 1 of SEQ. ID. No. 1 . The present invention also 
involves an isolated mutant acid phosphatase/phytase which differs from the wild- 
25 type acid phosphatase/phytase having an amino acid sequence of SEQ. ID. No. 1 
by at least one amino acid substitution which disrupts disulfide bond formation 
between Cys amino acid residues at positions 200 and 210. The mutant acid 
phosphatase/phytase of the present invention is useful in animal feed 
compositions. 

30 The present invention also relates to a method for improving the 

enzymatic properties of a wild-type Escherichia coli acid phosphatase/phytase 
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having an amino acid sequence of SEQ. ID. No. 1 . This method involves altering 
the amino acid sequence of the wild-type acid phosphatase/phytase by introducing 
amino acid substitutions into SEQ. ID. No. 1 at positions 200, 207, and 21 1 . 
Another embodiment of this method involves altering the amino acid sequence of 
5 the wild-type acid phosphatase/phytase having SEQ. ID. No. 1 by introducing at 
least one amino acid substitution which disrupts disulfide bond formation between 
Cys amino acid residues at positions 200 and 210. 

Another aspect of this invention relates to an isolated DNA 
molecule which encodes the mutant acid phosphatase/phytase of the present 
1 0 invention. Also disclosed are recombinant DNA expression systems and host 
cells containing the DNA molecule of the present invention. These constructions 
can be used to recombinantly produce the mutant acid phosphatase/phytase of the 
present invention. 

The invention also provides a basic molecular method that can be 
15 broadly applied to design mutant acid phosphatases/phytases derived from various 
source organisms, resulting in mutants with enhanced enzymatic properties such 
as greater thermostability and catalytic efficiency. This method includes 
identifying and isolating a gene of a wild-type enzyme and using this gene as the 
object of site^directed mutagenesis in order to enhance the enzyme's function 
20 and/or stability. One aspect of this invention is to use site-directed mutagenesis to 
make targeted mutations to the wild-type gene in order to add N-glycosylation 
sites to the wild-type enzyme and/or to alter the enzyme's physiochemical 
properties (e.g., increasing the net positive charge of the enzyme). In addition, 
targeted mutations can be made to the wild-type gene in order to eliminate certain 
25 disulfide bonds found in the final protein product, resulting in enhanced 
thermostability and catalytic function. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 shows the nucleotide (SEQ. ID. No. 2) and the deduced 
30 amino acid (SEQ. ID. No. 1) sequence of the E. coli acid phosphatase/phytase 
(appA). Primers are underlined and indicated by arrows. The GH loop region 
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(202-21 1) is in bold and C200 (in G helix) and C210 (in GH loop) form the 
unique disulfide bond in the a-domain. Substituted amino acids (A 13 1, V134N, 
C200, D207, and S21 1 ) are underlined and in bold. 

Figure 2 shows an SDS-gel electrophoresis (15%) analysis of 
5 purified recombinant proteins expressed in Pichia pastoris. Thirty micrograms of 
protein was loaded per lane. Lane M, prestained marker (Biorad, kDa) 
(phosphorylase ft, 103; bovine serum albumin, 76; ovalbumin, 49; carbonic 
anhydrase, 33.2; soybean trypsin inhibitor, 28); Lane 1, Endo H/(endoglycosidase 
Hy); Lane 2, r-AppA (recombinant protein produced by appA in Pichia pastor is)\ 
10 Lane 3, r-AppA + Endo H/, Lane 4, Mutant U; Lane 5, Mutant U + Endo H/, Lane 
6, Mutant R; Lane 7, Mutant R + Endo H/, Lane 8, Mutant Y; Lane 9, Mutant Y + 
Endo H/. 

Figure 3 shows the pH dependence of the enzymatic activity at 
37°C of the purified r-AppA (•) and Mutants (U, ■ ; Y, R, ♦) using sodium 
1 5 phytate as a substrate. The maximal activity for each mutant and r-AppA was 
defined as 100%. Buffers: pH 1.5-3.5, 0.2 M glycine-HCl; pH 4.5-7.5, 0.2 M 
sodium citrate; pH 8.5-1 1, 0.2 M Tris-HCL Asterisks indicate significant 
differences {P < 0.05) between r-AppA and other mutants. Results are expressed 
as the mean ± SE from three experiments. 

20 Figure 4 shows the residual enzymatic activity of the purified 

r-AppA (•) and Mutants (U, ■ ; Y, ▲; R, ♦) after exposure for 1 5 min at the 
indicated temperature. The purified enzyme was incubated for 15 min in 0.2 M 
glycine-HCl, pH 2.5. At the end of heating, the reaction mixture was cooled on 
ice for 30 min. The initial activity with sodium phytate for each recombinant 

25 enzyme was defined as 1 00%. Asterisks indicate significant differences (P < 

0.05) between r-AppA and other mutants. Results are expressed as the mean ± SE 
from three experiments. 
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DET AILED DESCRIPTION OF THE INVENTION 

The present invention relates to an isolated mutant acid 
phosphatase/phytase which is produced by site-directed mutagenesis of a wild- 
type Escherichia coli acid phosphatase/phytase. According to one embodiment, 
5 the mutant acid phosphatase/phytase is made by introducing a plurality of targeted 
amino acid substitutions in a wild-type Escherichia coli acid phosphatase/phytase. 
In another embodiment, the mutant acid phosphatase/phytase is produced by 
introducing at least one amino acid substitution into the wild-type acid 
phosphatase/phytase in order to disrupt disulfide bond formation between Cys 
10 amino acid residues of the mutant acid phosphatase/phytase. The wild-type acid 
phosphatase/phytase has an amino acid sequence corresponding to SEQ. ID. No. 1 
as follows: 



15 



30 



45 



Met Lys Ala lie Leu lie Pro Phe Leu Ser Leu Leu lie Pro Leu Thr 
15 10 15 

Pro Gin Ser Ala Phe Ala Gin Ser Glu Pro Glu Leu Lys Leu Glu Ser 

20 25 30 



Val Val lie Val Ser Arg His Gly Val Arg Ala Pro Thr Lys Ala Thr 

20 35 40 45 

Gin Leu Met Gin Asp Val Thr Pro Asp Ala Trp Pro Thr Trp Pro Val 

50 55 60 

25 Lys Leu Gly Trp Leu Thr Pro Arg Gly Gly Glu Leu lie Ala Tyr Leu 

65 70 75 80 



Gly His Tyr Gin Arg Gin Arg Leu Val Ala Asp Gly Leu Leu Ala Lys 

85 90 95 

Lys Gly Cys Pro Gin Pro Gly Gin Val Ala lie lie Ala Asp Val Asp 

100 105 110 



Glu Arg Thr Arg Lys Thr Gly Glu Ala Phe Ala Ala Gly Leu Ala Pro 
35 115 120 125 

Asp Cys Ala lie Thr Val His Thr Gin Ala Asp Thr Ser Ser Pro Asp 
130 135 140 

40 Pro Leu Phe Asn Pro Leu Lys Thr Gly Val Cys Gin Leu Asp Asn Ala 

145 150 155 160 



Asn Val Thr Asp Ala lie Leu Ser Arg Ala Gly Gly Ser lie Ala Asp 

165 170 175 

Phe Thr Gly His Arg Gin Thr Ala Phe Arg Glu Leu Glu Arg Val Leu 

180 185 190 
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Asn Phe Pro Gin Ser Asn Leu Cys Leu Lys Arg Glu Lys Gin Asp Glu 
195 200 205 

Ser Cys Ser Leu Thr Gin Ala Leu Pro Ser Glu Leu Lys Val Ser Ala 
5 210 215 220 

Asp Asn Val Ser Leu Thr Gly Ala Val Ser Leu Ala Ser Met Leu Thr 
225 230 235 240 

10 Glu lie Phe Leu Leu Gin Gin Ala Gin Gly Met Pro Glu Pro Gly Trp 

245 250 255 



15 



30 



45 



50 



Gly Arg lie Thr Asp Ser His Gin Trp Asn Thr Leu Leu Ser Leu His 

260 265 270 

Asn Ala Gin Phe Tyr Leu Leu Gin Arg Thr Pro Glu Val Ala Arg Ser 
275 280 285 



Arg Ala Thr Pro Leu Leu Asp Leu lie Lys Thr Ala Leu Thr Pro His 
20 290 295 300 

Pro Pro Gin Lys Gin Ala Tyr Gly Val Thr Leu Pro Thr Ser Val Leu 

305 310 315 320 

25 Phe lie Ala Gly His Asp Thr Asn Leu Ala Asn Leu Gly Gly Ala Leu 

325 330 335 



Glu Leu Asn Trp Thr Leu Pro Gly Gin Pro Asp Asn Thr Pro Pro Gly 

340 345 350 

Gly Glu Leu Val Phe Glu Arg Trp Arg Arg Leu Ser Asp Asn Ser Gin 
355 360 365 



Trp He Gin Val Ser Leu Val Phe Gin Thr Leu Gin Gin Met Arg Asp 
35 370 375 380 

Lys Thr Pro Leu Ser Leu Asn Thr Pro Pro Gly Glu Val Lys Leu Thr 
385 390 395 400 

40 Leu Ala Gly Cys Glu Glu Arg Asn Ala Gin Gly Met Cys Ser Leu Ala 

405 410 415 



Gly Phe Thr Gin He Val Asn Glu Ala Arg He Pro Ala Cys Ser Leu * 

420 425 430 



The wild-type acid phosphatase/phytase having the amino acid 
sequence according to SEQ. ID. No. 1 is encoded by the coding sequence of bases 
187-1486 of the nucleotide sequence of SEQ. ID. No. 2 as follows: 

1 taa gga gca gaa aca ATG TGG TAT TTA CTT TGG TTC GTC GGC ATT 

46 TTG TTG ATG TGT TCG CTC TCC ACC CTT GTG TTG GTA TGG CTG GAC 

91 CCG CGA TTG AAA AGT T aac gaa cgt agg cct gat gcg gcg cat 

55 134 tag cat cgc ate agg caa tea ata atg tea gat atg aaa age gga 
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15 



25 



179 aac ata teg ATG AAA GCG ATC TTA ATC CCA TTT TTA TCT CTT CTG 

224 ATT CCG TTA ACC CCG CAA TCT GCA TTC GCT CAG AGT GAG CCG GAG 

5 

269 CTG AAG CTG GAA AGT GTG GTG ATT GTC AGC CGT CAT GGT GTG CGT 

314 GCC CCA ACC AAG GCC ACG CAA CTG ATG CAG GAT GTC ACC CCA GAC 

10 359 GCA TGG CCA ACC TGG CCG GTA AAA CTG GGT TGG CTG AC A CCA CGC 

404 GGT GGT GAG CTA ATC GCC TAT CTC GGA CAT TAC CAA CGC CAG CGT 

449 CTG GTG GCC GAC GGA TTG CTG GCG AAA AAG GGC TGC CCG CAG CCT 

494 GGT CAG GTC GCG ATT ATT GTC GAT GTC GAC GAG CGT ACC CGT AAA 

539 ACA GGC GAA GCC TTC GCC GCC GGG CTG GCA CCT GAC TGT GCA ATA 

20 584 ACC GTA CAT ACC CAG GCA GAT ACG TCC AGT CCC GAT CCG TTA TTT 

62 9 ATT CCT CTA AAA ACT GGC GTT TGC CAA CTG GAT AAC GCG AAC GTG 

674 ACT GAC GCG ATC CTC AGC AGG GCA GGA GGG TCA ATT GCT GAC TTT 

719 ACC GGG CAT CGG CAA ACG GCG TTT CGC GAA CTG GAA CGG GTG CTT 

764 AAT TTT CCG CAA TCA AAC TTG TGC CTT AAA CGT GAG AAA CAG GAC 

30 809 GAA AGC TGT TCA TTA ACG CAG GCA TTA CCA TCG GAA CTC AAG GTG 

854 AGC GCC GAC AAT GTT TCA TTA ACC GGT GCG GTA AGC CTC GCA TCA 

899 ATG CTG ACG GAA ATA TTT CTC CTG CAA CAA GCA CAG GGA ATG CCG 

944 GAG CCG GGG TGG GGA AGG ATC ACT GAT TCA CAC CAG TGG AAC ACC 

98 9 TTG CTA AGT TTG CAT AAC GCG CAA TTT TAT TTA CTA CAA CGC ACG 

40 1034 CCA GAG GTT GCC CGC AGT CGC GCC ACC CCG TTA TTG GAT TTG ATC 

1079 AAG ACA GCG TTG ACG CCC CAT CCA CCG CAA AAA CAG GCG TAT GGT 

1124 GTG ACA TTA CCC ACT TCA GTG CTG TTT ATT GCC GGA CAC GAT ACT 

1169 AAT CTG GCA AAT CTC GGC GGC GCA CTG GAG CTC AAC TGG ACG CTT 

1214 CCA GGT CAG CCG GAT AAC ACG CCG CCA GGT GGT GAA CTG GTG TTT 

50 12 59 GAA CGC TGG CGT CGG CTA AGC GAT AAC AGC CAG TGG ATT CAG GTT 

1304 TCG CTG GTC TTC CAG ACT TTA CAG CAG ATG CGT GAT AAA ACG CCG 

1349 CTA TCA TTA AAT ACG CCG CCC GGA GAG GTG AAA CTG ACC CTG GCA 

1394 GGA TGT GAA GAG CGA AAT GCG CAG GGC ATG TGT TCG TTG GCC GGT 

1439 TTT ACG CAA ATC GTG AAT GAA GCG CGC ATA CCG GCG TGC AGT TTG 

60 14 84 TAA 



35 



45 



55 
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This acid phosphatase/phytase is derived from E. coli. 

In producing the mutant acid phosphatase/phytase of the present 
invention, amino acid substitutions are made at positions 200, 207, and 21 1 of 
5 SEQ. ID. No. 1. It is particularly preferred to have the amino acid substitutions in 
the acid phosphatase/phytase of SEQ. ID. No. 1 be as follows: at position 200, be 
an Asn amino acid residue instead of a Cys amino acid residue; at position 207, be 
an Asn amino acid residue instead of an Asp amino acid residue; and at position 
21 1 , be an Asn amino acid residue instead of a Ser amino acid residue. As a 
1 0 result, the mutant acid phosphatase/phytase has an amino acid sequence of SEQ. 
ID. No. 3 as follows (the amino acid substitutions are underlined and in bold): 

Met Lys Ala lie Leu lie Pro Phe Leu Ser Leu Leu lie Pro Leu Thr 
15 10 15 

15 Pro Gin Ser Ala Phe Ala Gin Ser Glu Pro Glu Leu Lys Leu Glu Ser 

20 25 30 



20 



35 



Val Val lie Val Ser Arg His Gly Val Arg Ala Pro Thr Lys Ala Thr 
35 40 45 

Gin Leu Met Gin Asp Val Thr Pro Asp Ala Trp Pro Thr Trp Pro Val 
50 55 60 



Lys Leu Gly Trp Leu Thr Pro Arg Gly Gly Glu Leu He Ala Tyr Leu 
25 65 70 75 80 

Gly His Tyr Gin Arg Gin Arg Leu Val Ala Asp Gly Leu Leu Ala Lys 

. 85 90 95 

30 Lys Gly Cys Pro Gin Pro Gly Gin Val Ala He He Ala Asp Val Asp 

100 105 110 



Glu Arg Thr Arg Lys Thr Gly Glu Ala Phe Ala Ala Gly Leu Ala Pro 
115 120 125 

Asp Cys Ala He Thr Val His Thr Gin Ala Asp Thr Ser Ser Pro Asp 
130 135 140 



Pro Leu Phe Asn Pro Leu Lys Thr Gly Val Cys Gin Leu Asp Asn Ala 
40 145 150 155 160 

Asn Val Thr Asp Ala He Leu Ser Arg Ala Gly Gly Ser lie Ala Asp 

165 170 175 

45 Phe Thr Gly His Arg Gin Thr Ala Phe Arg Glu Leu Glu Arg Val Leu 

180 185 190 



50 



Asn Phe Pro Gin Ser Asn Leu Asn Leu Lys Arg Glu Lys Gin Asn Glu 
195 200 205 
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Ser Cys Asn Leu Thr Gin Ala Leu Pro Ser Glu Leu Lys Val Ser Ala 
210 215 220 

Asp Asn Val Ser Leu Thr Gly Ala Val Ser Leu Ala Ser Met Leu Thr 
5 225 230 235 240 

Glu lie Phe Leu Leu Gin Gin Ala Gin Gly Met Pro Glu Pro Gly Trp 

245 250 255 

10 Gly Arg He Thr Asp Ser His Gin Trp Asn Thr Leu Leu Ser Leu His 

260 265 270 



15 



30 



Asn Ala Gin Phe Tyr Leu Leu Gin Arg Thr Pro Glu Val Ala Arg Ser 
275 280 285 

Arg Ala Thr Pro Leu Leu Asp Leu He Lys Thr Ala Leu Thr Pro His 
290 295 300 



Pro Pro Gin Lys Gin Ala Tyr Gly Val Thr Leu Pro Thr Ser Val Leu 
20 305 310 315 320 

Phe He Ala Gly His Asp Thr Asn Leu Ala Asn Leu Gly Gly Ala Leu 

325 330 335 

25 Glu Leu Asn Trp Thr Leu Pro Gly Gin Pro Asp Asn Thr Pro Pro Gly 

340 345 350 



Gly Glu Leu Val Phe Glu Arg Trp Arg Arg Leu Ser Asp Asn Ser Gin 

355 360 365 

Trp He Gin Val Ser Leu Val Phe Gin Thr Leu Gin Gin Met Arg Asp 

370 375 380 



Lys Thr Pro Leu Ser Leu Asn Thr Pro Pro Gly Glu Val Lys Leu Thr 
35 385 390 395 400 

Leu Ala Gly Cys Glu Glu Arg Asn Ala Gin Gly Met Cys Ser Leu Ala 

405 410 415 

40 Gly Phe Thr Gin He Val Asn Glu Ala Arg He Pro Ala Cys Ser Leu * 

420 425 430 

The mutant acid phosphatase/phytase of SEQ. ID. No. 3 has a molecular mass of 
45 to 48 kDa, after deglycosylation, and has a specific phytase activity of 63 
45 U/mg. The mature protein is represented by the amino acid sequence of amino 
acids 21-432 of SEQ. ID. No. 3. 

Another aspect of the present invention involves producing a 
mutant acid phosphatase/phytase by inserting at least one amino acid substitution 
into the amino acid sequence of SEQ. ID. No. 1 in order to disrupt disulfide bond 
50 formation in the mutant acid phosphatase/phytase. In particular, targeted 
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substitution of the Cys amino acid residue at positions 200 and/or 210 of SEQ. ID. 
No. 1 can be made in order to eliminate the disulfide bond between these residues. 

The mutant acid phosphatase/phytase having an amino acid 
sequence according to SEQ. ID. No. 3 is encoded by the coding sequence of bases 
5 1 87-1 486 of the nucleotide sequence of SEQ. ID. No. 4 as follows (the codons for 
the substituted Asn residues at amino acid positions 200, 207, and 211 are 
underlined and in bold): 



10 



15 



20 



25 



30 



35 



40 



45 



50 



1 


taa 


gga 


gca 


gaa 


aca 


ATG 


TGG 


TAT 


TTA 


CTT 


TGG 


TTC 


GTC 


GGC 


ATT 


46 


TTG 


TTG 


ATG 


TGT 


TCG 


CTC 


TCC 


ACC 


CTT 


GTG 


TTG 


GTA 


TGG 


CTG 


GAC 


91 


CCG 


CGA 


TTG 


AAA 


AGT 


T aac gaa cgt agg cct gat gcg gcg cat 


134 


tag 


cat 


cgc 


ate 


agg 


caa 


tea 


a Ld 


aCy 


^ 


gar 


a eg 


aaa 


age 


gga 


179 


aac 


ata 


teg ATG AAA GCG ATC 


i in 


ATP 






1 in 


r Pr"r 
1L1 


L 1 1 


CTG 


224 


ATT 


CCG 


TTA 


ACC 


CCG 


CAA 


TCT 




ttp 


ul 1 


prp 






CCG 


GAG 


269 


CTG 


AAG 


CTG 


GAA 


AGT 


GTG 


GTG 


ATT 




ap.p 


PPT 


PAT 


1 




r*r* rp 
LGT 


314 


GCC 


CCA 


ACC 


AAG 


GCC 


ACG 


CAA 


CTG 


ATG 


CAG 


GAT 


GTC 


ACC 


CCA 


GAC 


359 


GCA 


TGG 


CCA 


ACC 


TGG 


CCG 


GTA 


AAA 


CTG 


GGT 


TGG 


CTG 


ACA 


CCA 


CGC 


404 


GGT 


GGT 


GAG 


CTA 


ATC 


GCC 


TAT 


CTC 


GGA 


CAT 


TAC 


CAA 


CGC 


CAG 


CGT 


449 


CTG 


GTG 


GCC 


GAC 


GGA 


TTG 


CTG 


GCG 


AAA 


AAG 


GGC 


TGC 


CCG 


CAG 


CCT 


494 


GGT 


CAG 


GTC 


GCG ATT 


ATT 


GTC 


GAT 


GTC 


GAC 


GAG 


CGT 


ACC 


CGT 


AAA 


539 


ACA 


GGC 


GAA 


GCC 


TTC 


GCC 


GCC 


GGG 


CTG 


GCA 


CCT 


GAC 


TGT 


GCA 


ATA 


584 


ACC 


GTA 


CAT 


ACC 


CAG 


GCA 


GAT 


ACG 


TCC 


AGT 


CCC 


GAT 


CCG 


TTA 


TTT 


629 


ATT 


CCT 


CTA 


AAA 


ACT 


GGC 


GTT 


TGC 


CAA 


CTG 


GAT 


AAC 


GCG 


AAC 


GTG 


674 


ACT 


GAC 


GCG 


ATC 


CTC 


AGC 


AGG 


GCA 


GGA 


GGG 


TCA 


ATT 


GCT 


GAC 


TTT 


719 


ACC 


GGG 


CAT 


CGG 


CAA 


ACG 


GCG 


TTT 


CGC 


GAA 


CTG 


GAA 


CGG 


GTG 


CTT 


764 


AAT 


TTT 


CCG 


CAA 


TCA 


AAC 


TTG 


AAC 


CTT 


AAA 


CGT 


GAG 


AAA 


CAG 


AAT 


809 


GAA 


AGC 


TGT 


AAC 


TTA 


ACG 


CAG 


GCA 


TTA 


CCA 


TCG 


GAA 


CTC 


AAG 


GTG 


854 


AGC 


GCC 


GAC 


AAT 


GTT 


TCA 


TTA 


ACC 


GGT 


GCG 


GTA 


AGC 


CTC 


GCA 


TCA 


899 


ATG 


CTG 


ACG 


GAA 


ATA 


TTT 


CTC 


CTG 


CAA 


CAA 


GCA 


CAG 


GGA 


ATG 


CCG 


944 


GAG 


CCG 


GGG 


TGG 


GGA 


AGG 


ATC 


ACT 


GAT 


TCA 


CAC 


CAG 


TGG 


AAC 


ACC 


989 


TTG 


CTA 


AGT 


TTG 


CAT 


AAC 


GCG 


CAA 


TTT 


TAT 


TTA 


CTA 


CAA 


CGC 


ACG 
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1034 CCA GAG GTT GCC CGC AGT CGC GCC ACC CCG TTA TTG GAT TTG ATC 

1079 AAG ACA GCG TTG ACG CCC CAT CCA CCG CAA AAA CAG GCG TAT GGT 

5 

1124 GTG ACA TTA CCC ACT TCA GTG CTG TTT ATT GCC GGA CAC GAT ACT 

1169 AAT CTG GCA AAT CTC GGC GGC GCA CTG GAG CTC AAC TGG ACG CTT 

10 1214 CCA GGT CAG CCG GAT AAC ACG CCG CCA GGT GGT GAA CTG GTG TTT 

12 59 GAA CGC TGG CGT CGG CTA AGC GAT AAC AGC CAG TGG ATT CAG GTT 

1304 TCG CTG GTC TTC CAG ACT TTA CAG CAG ATG CGT GAT AAA ACG CCG 

15 

1349 CTA TCA TTA AAT ACG CCG CCC GGA GAG GTG AAA CTG ACC CTG GCA 

1394 GGA TGT GAA GAG CGA AAT GCG CAG GGC ATG TGT TCG TTG GCC GGT 

20 143 9 TTT ACG CAA ATC GTG AAT GAA GCG CGC ATA CCG GCG TGC AGT TTG 

1484 TAA 



One embodiment of the present invention involves the insertion of 
25 the mutant acid phosphatase/phytase gene into an expression vector system, using 
recombinant DNA technology well known in the art. This enables one to express 
this gene in a host cell, allowing for the production and purification of the acid 
phosphatase/phytase for use in compositions, such as for animal feed. 

The DNA of the mutant acid phosphatase/phytase gene can be 
30 isolated and/or identified using DNA hybridization techniques. Nucleic acid 

(DNA or RNA) probes of the present invention will hybridize to a complementary 
nucleic acid under stringent conditions. Less stringent conditions may also be 
selected. Generally, stringent conditions are selected to be about 50°C lower than 
the thermal melting point (T m ) for the specific sequence at a defined ionic strength 
35 and pH. The T m is the temperature (under defined ionic strength and pH) at which 
50% of the target sequence hybridizes to a perfectly matched probe. The T m is 
dependent upon the solution conditions and the base composition of the probe, 
and for DNArRNA hybridization may be calculated using the following equation: 

T m = 79.8°C + (18.5 x Log[Na+]) 
40 + (58.4°C x %[G+C]) 
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(820 / #bp in duplex) 

(0.5 x % formamide) 

Promega Protocols and Applications Guide . 2d ed., Promega Corp., Madison, WI 
(1991), which is hereby incorporated by reference. Nonspecific binding may also 
5 be controlled using any one of a number of known techniques such as, for 

example, blocking the membrane with protein-containing solutions, addition of 
heterologous RNA, DNA, and SDS to the hybridization buffer, and treatment with 
RNase. 

Generally, suitable stringent conditions for nucleic acid 

1 0 hybridization assays or gene amplification detection procedures are as set forth 

above or as identified in Southern, "Detection of Specific Sequences Among DNA 
Fragments Separated by Gel Electrophoresis," J. Mol. Biol. . 98:5 03-17 (1975), 
which is hereby incorporated by reference. For example, conditions of 
hybridization at 42°C with 5X SSPE and 50% formamide with washing at 50°C 

1 5 with 0.5X SSPE can be used with a nucleic acid probe containing at least 20 

bases, preferably at least 25 bases or more preferably at least 30 bases. Stringency 
may be increased, for example, by washing at 55°C or more preferably 60°C using 
an appropriately selected wash medium having an increase in sodium 
concentration (e.g., IX SSPE, 2X SSPE, 5X SSPE, etc.). If problems remain with 

20 cross-hybridization, further increases in temperature can also be selected, for 
example, by washing at 65°C, 70°C, 75°C, or 80°C. By adjusting hybridization 
conditions, it is possible to identify sequences having the desired degree of 
homology (i.e., greater than 80%, 85%, 90%, or 95%) as determined by the 
TBLASTN program (Altschul, S.F., et al., "Basic Local Alignment Search Tool," 

25 J. Mol. Biol. 215:403-410 (1990), which is hereby incorporated by reference) on 
its default setting. 

A preferred method of detecting the mutant acid 
phosphatase/phytase of the present invention is by using the methods known in the 
art as ligase detection reaction (LDR) and ligase chain reaction (LCR), as 
30 described in Barany, "Genetic Disease Detection and DNA Amplification Using 
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Cloned Thermostable Ligase Proc. Natl. Acad. Sci. USA 88(1): 1 89-193 (1991), 
which is hereby incorporated by reference. 

The DNA molecule of the present invention can be expressed in 
any prokaryotic or eukaryotic expression system by incorporation of the DNA 
5 molecule in the expression system in proper orientation and correct reading frame. 
A variety of host-vector systems may be utilized to express the protein-encoding 
sequence(s). Preferred vectors include a viral vector, plasmid, cosmid or an 
oligonucleotide. Primarily, the vector system must be compatible with the host 
cell used. Host-vector systems include but are not limited to the following: 

1 0 bacteria transformed with bacteriophage DNA, plasmid DNA, or cosmid DNA; 
microorganisms such as yeast containing yeast vectors; mammalian cell systems 
infected with virus (e.g., vaccinia virus, adenovirus, etc.); insect cell systems 
infected with virus (e.g., baculovirus); and plant cells infected by bacteria. The 
expression elements of these vectors vary in their strength and specificities. 

15 Depending upon the host-vector system utilized, any one of a number of suitable 
transcription and translation elements can be used. For example, a DNA molecule 
in accordance with the present invention is spliced in frame with a transcriptional 
enhancer element. 

Preferred hosts for expressing the DNA molecule of the present 
20 invention include fungal cells, including species of yeast or filamentous fungi, 
may be used as host cells in accordance with the present invention. Preferred 
yeast host cells include different strains of Saccharomyces cerevisiae. Other 
yeasts like Kluyveromyces, Torulaspora, and Schizosaccharomyces can also be 
used. In a preferred embodiment, the yeast strain used to overexpress the protein 
25 is Saccharomyces cerevisiae. Preferred filamentous fungi host cells include 

Aspergillus and Neurospora. A more preferred strain of Aspergillus is Aspergillus 
niger. 

In another preferred embodiment of the present invention, the yeast 
strain is a methylotrophic yeast strain. Methylotrophic yeast are those yeast 
30 genera capable of utilizing methanol as a carbon source for the production of the 
energy resources necessary to maintain cellular function and containing a gene for 
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the expression of alcohol oxidase. Typical methylotrophic yeasts include 
members of the genera Pichia, Hansenula y Torulopsis, Candida, and Karwinskia. 
These yeast genera can use methanol as a sole carbon source. In a more preferred 
embodiment, the methylotrophic yeast strain is Pichia pastoris. 

5 Purified protein may be obtained by several methods. The protein 

or polypeptide of the present invention is preferably produced in purified form 
(preferably at least about 80%, more preferably 90%, pure) by conventional 
techniques. Typically, the protein or polypeptide of the present invention is 
secreted into the growth medium of recombinant host cells. Alternatively, the 

10 protein or polypeptide of the present invention is produced but not secreted into 
growth medium. In such cases, to isolate the protein, the host cell carrying a 
recombinant plasmid is propagated, lysed by sonication, heat, or chemical 
treatment, and the homogenate is centrifuged to remove cell debris. The 
supernatant is then subjected to sequential ammonium sulfate precipitation. The 

15 fraction containing the polypeptide or protein of the present invention is subjected 
to gel filtration in an appropriately sized dextran or polyacrylamide column to 
separate the proteins. If necessary, the protein fraction may be further purified by 
HPLC. 

The present invention also provides a yeast strain having a 
20 heterologous gene which encodes a protein or polypeptide with phytase activity. 
The heterologous gene should be functionally linked to a promoter capable of 
expressing phytase in yeast. 

Yet another aspect of the invention is a vector for expressing 
phytase in yeast. The vector carries a gene from a non-yeast organism which 

25 encodes a protein or polypeptide with phytase activity. The phytase gene can be 
cloned into any vector which replicates autonomously or integrates into the 
genome of yeast. The copy number of autonomously replicating plasmids, e.g. 
YEp plasmids, may be high, but their mitotic stability may be insufficient (Bitter 
et al., "Expression and Secretion Vectors for Yeast," Meth. Enzvmol. 153:516-44 

30 (1 987), which is hereby incorporated by reference). They may contain the 2 
mu-plasmid sequence responsible for autonomous replication, and an £. coli 
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sequence responsible for replication in E. coli. The vectors preferably contain a 
genetic marker for selection of yeast transformants, and an antibiotic resistance 
gene for selection in E. coli. The episomal vectors containing the ARS and CEN 
sequences occur as a single copy per cell, and they are more stable than the YEp 
5 vectors. Integrative vectors are used when a DNA fragment is integrated as one or 
multiple copies into the yeast genome. In this case, the recombinant DNA is 
stable and no selection is needed (Struhl et al., "High-Frequency Transformation 
of Yeast: Autonomous Replication of Hybrid DNA Molecules Proc. Nat'l Acad. 
Sci. USA 76:1035-39 (1979); Powels et al. s Cloning Vectors. I-IV. et sea. 

10 Elsevier, (1985); and Sakai et al., "Enhanced Secretion of Human Nerve Growth 
Factor from Saccharomyces Cerevisiae Using an Advanced 5-Integration 
System," Biotechnology 9:1382-85 (1991), which are hereby incorporated by 
reference). Some vectors have an origin of replication, which functions in the 
selected host cell. Suitable origins of replication include 2|i, ARS1, and 25^iM. 

1 5 The vectors have restriction endonuclease sites for insertion of the fusion gene and 
promoter sequences, and selection markers. The vectors may be modified by 
removal or addition of restriction sites, or removal of other unwanted nucleotides. 

The phytase gene can be placed under the control of any promoter 
(Stetler et al., "Secretion of Active, Full- and Half-Length Human Secretory 

20 Leukocyte Protease Inhibitor by Saccharomyces cerevisiae" Biotechnology 
7:55-60, (1989), which is hereby incorporated by reference). One can choose a 
constitutive or regulated yeast promoter. Suitable promoter sequences for yeast 
vectors include, among others, promoters for metallothionein, 3-phosphoglycerate 
kinase (Hitzeman et al., J. Biol. Chem . 255:2073 (1980), which is hereby 

25 incorporated by reference) or other glycolytic enzymes (Hess et aL, J. Adv. 

Enzvme Reg. 7:149 (1968); and Holland et al., Biochem. 17:4900, (1978), which 
are hereby incorporated by reference), such as enolase, glyceraldehyde-3- 
phosphate dehydrogenase, hexokinase, pyruvate decarboxylase, 
phosphofructokinase, glucose-6-phosphate isomerase, 3-phosphoglycerate mutase, 

30 pyruvate kinase, triosephosphate isomerase, phosphoglucose isomerase, and 

glucokinase. Other suitable vectors and promoters for use in yeast expression are 
further described in EP A-73,657 to Hitzeman, which is hereby incorporated by 
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reference. Another alternative is the glucose-repressible ADH2 promoter 
described by Russell et al., J. Biol. Chem. 258:2674 (1982) and Beier et al., 
Nature 300:724 (1982), which are hereby incorporated by reference. 

One can choose a constitutive or regulated yeast promoter. The 
5 strong promoters of e.g., phosphoglycerate kinase (PGK) gene, other genes 

encoding glycolytic enzymes, and the alpha -factor gene, are constitutive. When a 
constitutive promoter is used, the product is synthesized during cell growth. The 
ADH2 promoter is regulated with ethanol and glucose, the GAL-1 -10 and GAL7 
promoters with galactose and glucose, the PH05 promoter with phosphate, and 

10 the metallothionine promoter with copper. The heat shock promoters, to which 
the HSP1 50 promoter belongs, are regulated by temperature. Hybrid promoters 
can also be used. A regulated promoter is used when continuous expression of the 
desired product is harmful for the host cells. Instead of yeast promoters, a strong 
prokaryotic promoter such as the T7 promoter, can be used, but in this case the 

1 5 yeast strain has to be transformed with a gene encoding the respective polymerase. 
For transcription termination, the HSP1 50 terminator, or any other functional 
terminator is used. Here, promoters and terminators are called control elements. 
The present invention is not restricted to any specific vector, promoter, or 
terminator. 

20 The vector may also carry a selectable marker. Selectable markers 

are often antibiotic resistance genes or genes capable of complementing strains of 
yeast having well characterized metabolic deficiencies, such as tryptophan or 
histidine deficient mutants. Preferred selectable markers include URA3, LEU2, 
HIS3, TRP1 , HIS4, ARG4, or antibiotic resistance genes. 

The vector may also have an origin of replication capable of 
replication in a bacterial cell. Manipulation of vectors is more efficient in 
bacterial strains. Preferred bacterial origin of replications are ColEl, Ori, or oriT. 

Preferably, the protein or polypeptide with phytase activity is 
secreted by the cell into growth media. This allows for higher expression levels 
and easier isolation of the product. The protein or polypeptide with phytase 
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activity is coupled to a signal sequence capable of directing the protein out of the 
cell. Preferably, the signal sequence is cleaved from the protein. 

A leader sequence either from the yeast or from phytase genes or 
other sources can be used to support the secretion of expressed phytase enzyme 
5 into the medium. The present invention is not restricted to any specific type of 
leader sequence or signal peptide. 

Suitable leader sequences include the yeast alpha factor leader 
sequence, which may be employed to direct secretion of the phytase. The alpha 
factor leader sequence is often inserted between the promoter sequence and the 

1 0 structural gene sequence (Kurjan et al., Cell 30:933, ( 1 982); Bitter et al., Proc. 
Natl. Acad. Sci. USA 81:5330, (1984); U.S. Patent No. 4,546,082; and European 
published patent application No. 324,274, which are hereby incorporated by 
reference). Another suitable leader sequence is the S. cerevisiae MF alpha 1 
(alpha-factor) is synthesized as a prepro form of 1 65 amino acids comprising 

1 5 signal-or prepeptide of 1 9 amino acids followed by a "leader" or propeptide of 64 
amino acids, encompassing three N-linked glycosylation sites followed by 
(LysArg(Asp/Glu, Ala)2-3 alpha-factor)4 (Kurjan, et al., Cell 30:933-43 (1982), 
which is hereby incorporated by reference). The signal-leader part of the 
preproMF alpha 1 has been widely employed to obtain synthesis and secretion of 

20 heterologous proteins in S. cerivisiae. Use of signal/leader peptides homologous 
to yeast is known from: U.S. Patent No. 4,546,082; European Patent Applications 
Nos. 116,201, 123,294, 123,544, 163,529, and 123,289; and DK Patent 
Application No. 3614/83, which are hereby incorporated by reference. In 
European Patent Application No. 123,289, which is hereby incorporated by 

25 reference, utilization of the S. cerevisiae a-factor precursor is described whereas 
WO 84/01 153, which is hereby incorporated by reference, indicates utilization of 
the Saccharomyces cerevisiae invertase signal peptide, and German Patent 
Application DK 3614/83, which is hereby incorporated by reference, indicates 
utilization of the Saccharomyces cerevisiae PH05 signal peptide for secretion of 

30 foreign proteins. 
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The alpha -factor signal-leader from Saccharomyces cerevisiae 
(MF alpha 1 or MF alpha 2) may also be utilized in the secretion process of 
expressed heterologous proteins in yeast (U.S. Patent No. 4,546,082; European 
Patent Applications Nos. 16,201, 123,294, 123,544, and 163,529, which are 
5 hereby incorporated by reference). By fusing a DNA sequence encoding the S. 
cerevisiae MF alpha 1 signal/ leader sequence at the 5' end of the gene for the 
desired protein, secretion and processing of the desired protein was demonstrated. 
The use of the mouse salivary amylase signal peptide (or a mutant thereof) to 
provide secretion of heterologous proteins expressed in yeast has been described 
1 0 in Published PCT Applications Nos. WO 89/02463 and WO 90/1 0075, which are 
hereby incorporated by reference. 

U.S. Patent No. 5,726,038 describes the use of the signal peptide of 
the yeast aspartic protease 3, which is capable of providing improved secretion of 
proteins expressed in yeast. Other leader sequences suitable for facilitating 
15 secretion of recombinant polypeptides from yeast hosts are known to those of 
skill in the art. A leader sequence may be modified near its 3' end to contain one 
or more restriction sites. This will facilitate fusion of the leader sequence to the 
structural gene. 

Yeast transformation protocols are known to those of skill in the 
20 art. One such protocol is described by Hinnen et ah, Proc. Natl. Acad. Sci. USA 
75:1929 (1978), which is hereby incorporated by reference. The Hinnen etal. 
protocol selects for Trp transformants in a selective medium, wherein the selective 
medium consists of 0.67% yeast nitrogen base, 0.5% casamino acids, 2% glucose, 
1 0 |ig/ml adenine and 20 |ig/ml uracil. 

25 The gene may be maintained in a stable expression vector, an 

artificial chromosome, or by integration into the yeast host cell chromosome. 

Integration into the chromosome may be accomplished by cloning the phytase 

> 

gene into a vector which will recombine into a yeast chromosome. Suitable 
vectors may include nucleotide sequences which are homologous to nucleotide 
30 sequences in the yeast chromosome. Alternatively, the phytase gene may be 
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located between recombination sites, such as transposable elements, which can 
mobilize the gene into the chromosome. 

Another aspect of the present invention relates to improving the 
enzymatic properties of a wild-type acid phosphatase/phytase. This is desirably 
5 achieved by altering the amino acid sequence of the wild-type acid 

phosphatase/phytase at positions 200, 207, and 21 1 as described above. For 
example, these modifications cause the acid phosphatase/phytase to have 
improved thermostability. Alternatively, the improved enzymatic property is 
phytase activity at a pH range of between about pH 3.5 to about pH 5.5. 

10 While the phytase enzyme produced in a yeast system released 

phytate-P from corn and soy as effectively as the currently commercial phytase, it 
appeared to be more thermostable. This phytase overexpression system in yeast 
can be used to provide thermostable phytase for use in the food and feed 
industries. 

1 5 The improved acid phosphatase/phytase of this invention can be 

used in animal feed to improve the digestion of phosphate by such simple- 
stomached animals as poultry, swine, pre-ruminant calves, zoo animals, and pets 
(e.g., cats and dogs). The present invention would decrease the need for 
supplementing animal feed with large amounts of inorganic phosphate, resulting 

20 in a less expensive form of animal feed and one that is less concentrated with the 
non-renewable form of phosphate. Since the present invention enhances the 
ability of simple-stomached animals to absorb phosphate, the fecal waste of these 
animals will contain less unutilized phytate-phosphate, which decreases the 
amount of phosphate pollution. 

25 In making the animal feed composition of the present invention, 

the mutant acid phosphatase/phytase is combined with a raw plant material and 
then processed into a pellet or powder form. The raw plant material may include 
various combinations of a number of plants and/or plant by-products commonly 
used in animal feed, including plants such as maize, soybean, wheat, rice, cotton 

30 seed, rapeseed, sorghum, and potato. In addition, the animal feed composition 
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may be fortified with various vitamins, minerals, animal protein, and antibiotics. 
One embodiment of the animal feed composition includes a mixture of 
appropriate concentrations of the mutant acid phosphatase/phytase, an energy 
source(s) (e.g., maize, wheat), a protein source(s) (e.g., soybean, rice, cottonseed 
5 meal, rapeseed meal, sorghum meal), and vitamin/mineral supplements. In 

particular, the amount of the mutant acid phosphatase/phytase would be 300-1 ,000 
Units/kg of feed. One example of a typical animal feed composition would 
include 50-70% maize, 20-30% soybean, approximately 1% vitamin and mineral 
supplements, and an appropriate amount of mutant acid phosphatase/phytase. 

10 In addition, the mutant acid phosphatase/phytase of the present 

invention could be used to enhance human nutrition, particularly by increasing the 
uptake of such minerals as zinc and iron. By adding the mutant acid 
phosphatase/phytase to the diets of humans, various problems arising from 
nutrient deficiencies, such as stunted growth and mental retardation in children, 

1 5 could be treated and avoided. 

The invention also provides a basic molecular method that can be 
broadly applied to design mutant acid phosphatases/phytases derived from various 
source organisms, resulting in mutants with enhanced enzymatic properties such 
as greater thermostability and catalytic efficiency. This method includes 

20 identifying and isolating a gene of a wild-type enzyme and using this gene as the 
object of site-directed mutagenesis in order to enhance the enzyme's function 
and/or stability. One aspect of this invention is to use site-directed mutagenesis to 
make targeted mutations to the wild-type gene in order to add N-glycosylation 
sites to the wild-type enzyme and/or to alter the enzyme's physiochemical 

25 properties (e.g., increasing the net positive charge of the enzyme). In addition, 
targeted mutations can be made to the wild-type gene in order to eliminate certain 
disulfide bonds found in the final protein product, resulting in enhanced 
thermostability and catalytic function. 
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EXAMPLES 

Example 1 - Sequence Analysis for Designing Mutations 

The criteria for designing mutations to enhance glycosylation of 
5 the AppA enzyme were 1) the potential glycosylation site should have 25% or 
greater solvent accessibility, and 2) the site should be easily engineered by a 
single residue change to give an N-linked glycosylation motif (Asn-X-Ser or Asn- 
X-Thr, where X is not a proline). Initially, in the absence of a crystal structure for 
the AppA enzyme, the crystal structure of rat acid phosphatase (35% sequence 

10 identity) (Schneider, G. et al., EMBO J. 12:2609-15 (1993), which is hereby 

incorporated by reference) was used to calculate accessibilities as follows. First, 
the AppA enzyme and rat acid phosphatase were aligned to several closely related 
phosphatases/phytases using the multi-sequence alignment program PIMA (Smith, 
R et al., Protein Engineering 5:35-41 (1992), which is hereby incorporated by 

15 reference). The aligned sequences included: human prostatic acid phosphatase 
precursor (GeneBank Accession No. PI 5309); Caenorhabditis elegam histidine 
acid phosphatase (GeneBank Accession No. Z6801 1); Aspergillus fumigatus 
phytase (GeneBank Accession No. U59804); Pichia angusta repressive acid 
phosphatase (GeneBank Accession No. AF051 161 1); rat acid phosphatase 

20 (GeneBank Accession No. 576257), and E. coli appA (GeneBank Accession 

No. M58708). Next, the solvent accessible surface of all of the amino acids of rat 
phosphatase was determined using the program DSSP (definition of secondary 
structure of proteins) (Kabsch, W. et al., Biooolvmers 22:2577-637 (1983), which 
is hereby incorporated by reference), converting these values to percent 

25 accessibility by dividing the total surface area of the corresponding amino acid as 
it has been previously described (Eisenberg, D. et al., Chemica Scrinta 29A, 217- 
221 (1989), which is hereby incorporated by reference). Only residues greater 
than 25% solvent were considered accessible. Values were assigned to the 
corresponding amino acids in the AppA enzyme based on the sequence alignment 

30 described above, under the assumption that the overall structure of rat acid 
phosphatase and the AppA enzyme would be conserved. Finally, the putative 
solvent accessible residues were examined to determine which could be easily 
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converted to an N-glycosylation site by point mutation. Out of 3 1 potential sites, 
5 were selected that best fit the desired criteria. An additional mutation C200N 
was incorporated using primer P2 designed for another appA mutagenesis study. 
From the alignment performed, the mutation C200N is in a gapped region and 
5 C200 is involved with C2 1 0 (labeled as C 1 78/C 1 88 by Lim et al., Nat. Struct. 
Biol. 7: 108-13 (2000), which is hereby incorporated by reference) in forming a 
unique disulfide bond between helix G and the GH loop (an unorganized 
configuration between the G and H helices) in the a-domain of the protein (Lim et 
al., Nat. Struct. Biol. 7: 108-13 (2000), which is hereby incorporated by reference). 
10 Correspondingly, 6 PCR primers were designed: E2 and K2 for amplifying the 
wild-type sequence of appA (Dassa, J. et al., J. Bacteriol. 172:5497-500 (1990), 
which is hereby incorporated by reference) and the others for developing four 
mutants (Table 1 and Figure 1). All the primers were synthesized by the Cornell 
University Oligonucleotide Synthesis Facility (Ithaca, NY). 
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TABLE 1 

Modified primers and index of surface solvent accessibility for mutations 



Primer 1 Posi- Primer sequence' 
tion 2 



Modifica- Access- 
tion 4 ibility 5 
(%) 



E2(f) 241- 5' GGAATTCGCTCAGAGCCGGA 3' 
264 (SEQ. ID. No. 5) 

Al (r) 565- 5' CTGGGTATGGTTGGTTATATTACAG 
592 TCAGGT3' 
(SEQ. ID. No. 6) 



P2 (0 772- 5' CAAACTTGAACCTTAAACGTGAG 3' 
795 (SEQ. ID. No. 7) 

P3 (r) 796- 5' CCTGCGTTAAGTTACAGCTTTCATT 
825 CTGTTT3' 

(SEQ. ID. No. 8) 



K2 (r) 1469- 5' GGGGTACCTTACAAACTGCACG 3 1 
1491 (SEQ. ID. No. 9) 



EcoK\ 

restriction 
site 

A131N 1.05 



V134N 0.55 



C2O0N nd 



D207N 0.63 



S211N 0.65 

Kpn\ 

restriction 
site 



': f, forward; r, reverse. 

2 : Nucleotide position based on the E.coli periplasmic pH 2.5 acid phosphatase (GeneBank 
Accession No. M58708). 

3 : Underlined nucleotides were substituted. 

4 : Amino-acid mutation or restriction site added. The coding region starts at the codon20 and 
ends at the codon 432. Amino acids A131, V134, C200, D207, and S21 1 are IabeledA109, 
VI 12, CI 78, D185, and SI 89 by Lim et al. (Lim et al., Nat. Struct. Biol. 7:108-13 (2000), ' 
which is hereby incorporated by reference). 

5 : Percentage of amino acid surface solvent accessibility (Smith, R. et ah, Protein Engineering , 
5:35-41 (1992); Kabsch, W. et al., Biopolvmers 22:2577-637 (1983), which are hereby 
incorporated by reference); nd, not determined. 



5 Example 2 - Construction of Mutants by PCR 

The E. coli appA mutants were constructed using the megaprimer 
site-directed mutagenesis method adapted from previous studies (Seraphin, B. et 
al., Nucl. Acids Res. 24:3276-77 (1996); Smith, A.M. et al., BioTechniaues 
22:438-39 (1997), which are hereby incorporated by reference). To amplify the 
1 0 intact coding region of appA, the PCR was set up in a 50 jlx! final volume 
containing 200 ng DNA of appA inserted in a pAPPAl plasmid isolated from 
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E. coli strain BL21 (Dassa, J. et al., J. Bacteriol. 172:5497-500 (1990), which is 
hereby incorporated by reference), 50 pmol of each primer E2 and K2, 5 U of 
AmpliTaq DNA polymerase (Perkin Elmer, Norwalk, CT), 10 mM Tris-HCl pH 
8.3, 50 mM KC1, 12.5 mM MgC12, and 200 mM each dNTPs (Promega Corp., 
5 Madison, WI). The reaction was performed using the GeneAmp PCR system 
2400 (Perkin Elmer), and included 1 cycle at 94°C (3 min), 30 cycles of [94°C 
(0.5 min), 54°C (1 min) and 72°C (1.5 min) ] and 1 cycle at 72°C (10 min). 
Megaprimers for mutants were produced in a separated round of PCR (Table 2). 

10 TABLE 2 

E. coli appA mutant denomination and construction 

Construct 1 Size bp No. glycosylation 

Mutants R E2A1P3K2 1350 7 

U E2P2P3K2 1350 5 

Y E2A1P2P3K2 1350 7 

Wild r-AppA E2K2 1350 3 

type 



] See Table 1 for primer denomination. 

The first mutagenic PCR reaction (100 |il) was performed as 
described above, using 4 [il of the intact appA PCR reaction mixture and the 

1 5 respective modified primers listed in Table 1 . All megaprimer PCR products were 
resolved in a 1.5% low melting agarose (Gibco BRL, Grand Island, NY) gel 
electrophoresis. The expected fragments were excised and eluted with 
GENECLEAN II kit (Bio 101, Vista, CA). The final mutagenic PCR reaction (100 
|il) was set up as described above, using 4 ^il of the appA PCR product and 

20 varying concentrations of the purified megaprimer (50 ng to 4 jig), depending on 
its size. Five thermal cycles were set up at 94°C for 1 min and 70°C for 2 min. 
While at 70°C, 1 ^mol of forward primer and 2 U of AmpliTaq DNA polymerase 
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were added and gently mixed with the reaction, and thermal cycling continued for 
25 times at 94°C for 1 min, 56°C for 1 min and 70°C for 1 .5 min. 

Example 3 - Subcloning and Expression 

5 E. coli strain TOP 1 OF' (Invitrogen, San Diego, C A) was used as an 

initial host. The PCR fragments were purified and cloned into pGEMT-Easy 
vector (Promega) according to the manufacturer's instructions. EcoRl digestion 
of the isolated plasmid DNA was used to screen for positive transformants. The 
resulting inserts were cloned into pPICZctA (Kit Easy-Select, Invitrogen) at the 

10 EcoRl site and transformed into TOPI OF' cells plated on LB (Luria-Bertani) 

medium containing 25 |ig/ml Zeocin. Colonies with desired inserts in the correct 
orientations were selected using Sail or BstXl restriction digestions of plasmid 
DNA. P. pastoris strain X33 (Mut+ His+) was used as the host for protein 
expression (Invitrogen) and grown in YPD (yeast extract peptone dextrose 

1 5 medium) liquid medium prior to electroporation. Two \ig of plasmid DNA were 
linearized using restriction enzyme BgtYL or Pmel and then transformed into X33 
according to the manufacturer's instructions (Invitrogen). After selected 
transformants were incubated in minimal media with glycerol (GMGY) for 24h, 
0.5% methanol medium (GMMY) was used to induce protein expression. 

20 

Example 4 - Enzyme Purification and Biochemical Characterization 

The expressed r-AppA and mutant enzymes in the medium 
supernatant were subjected to a two-step ammonium sulfate precipitation (25% 
and 75%) as previously described (Rodriguez, E. et al., Biochem. Biophvs. Res. 

25 Commun. 257: 1 1 7-23 ( 1 999), which is hereby incorporated by reference). The 
suspension of the first round was centrifuged at 4°C, 25,000 x g for 20 min. The 
pellet of the second round was suspended in 10 ml and dialyzed overnight against 
25 mM Tris-HCl, pH 7. After dialysis, the protein extract was loaded onto a 
DEAE (diethylaminoethyl)-Sepharose column (Sigma, St. Louis, MO) 

30 equilibrated with 25 mM Tris-HCl, pH 7. The bound protein was eluted with 1 M 
NaCl in 25 mM Tris-CHl, pH 7. Those three fractions exhibiting the highest 
activities were pooled and dialyzed against 25 mM Tris-HCl, pH 7.5 for the 
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following analysis. Phytase activity was measured using sodium phytate as the 
substrate (Rodriguez, E, et al., Biochem. Biophvs. Res. Commun, 257:1 17-23 
(1999); Piddington, C.S. et al., Gene 133:55-62 (1993), which are hereby 
incorporated by reference). The enzyme was diluted in 0.25 M glycine-HCl, pH 
5 2.5, and an equal volume of substrate solution containing 1 1 mM sodium phytate 
(Sigma) was added. After incubation of the sample for 15 min at 37°C, the 
reaction was stopped by addition of an equal volume of 15% trichloroacetic acid. 
Free inorganic phosphorus was measured at 820 nm after 0.2 ml of the sample 
was mixed with 1.8 ml of H 2 0 and 2 ml of a solution containing 0.6 M H 2 S0 4 , 2% 

10 ascorbic acid, and 0.5% ammonium molybdate, followed by incubation for 20 min 
at 50°C. One phytase unit was defined as the amount of activity that releases 1 
jimol of inorganic phosphorus from sodium phytate per minute at 37°C. The final 
concentrations of sodium phytate used for the enzyme kinetics were: 0.1, 0.25, 
0.5, 0.75, 1, 2.5, 10, and 25 mM. Acid phosphatase activity was assayed using 

1 5 pNPP (Sigma) at a final concentration of 25 mM (Smith, R. et al., Protein 

Engineering 5:35-41 (1992), which is hereby incorporated by reference). To 50 \x\ 
of enzyme (40 nmol), 850 ^1 of 250 mM glycine-HCl, pH 2.5, were added. After 
5 min of incubation at 37°C, 1 00 |il of pNPP was added. The released p- 
nitrophenol was measured at 405 nm after 0.1 ml of the sample was mixed with 

20 0.9 ml of 1 M NaOH and incubated for 10 min. The final concentrations of pNPP 
used for the enzyme kinetics were: 0.1, 0.2, 0.75, 1, 2.5, 10, and 25 mM. One 
unit of acid phosphatase/phytase activity was defined as the amount of enzyme 
catalyzing the formation of 1 nmol of /?-nitrophenol per minute. Before the 
thermostability assay, the enzyme (2 mg/ml) was diluted 1 :400 in 0.2 M glycine- 

25 HC1, pH 2.5. The diluted samples were incubated for 15 min at 25, 55, 80, and 
90°C. After the samples were cooled on ice for 30 min, their remaining phytase 
activies were measured as described above. Deglycosylation of purified enzymes 
was done by incubating 100 |ig of total protein with 0.5 IU endoglycosidase H/ 
(Endo Hf) for 4 h at 37°C according to the manufacturer's instructions (New 

30 England Biolabs, Beverly, MA). Sodium dodecyl sulfate-polyacrylamide gel 
electrophoresis (SDS-PAGE), 1 5% (w/v) gel was performed as previously 
described (Laemmli, U.K., Nature 227:680-85 (1970), which is hereby 
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incorporated by reference). Protein concentrations were determined using the 
Lowry method (Lowry, O.H. et al., J. Biol. Chem. 193:265-75 (1951), which is 
hereby incorporated by reference). 

5 Data were analyzed using SAS (release 6.04, SAS institute, Cary, 

NC, USA). 

Example 5 - Effects of Site-Directed Mutagenesis on Phytase Expression 

and Glycosylation 

10 

Genomic DNA from each yeast transformant was extracted to 
amplify the desired mutated appA by PCR (polymerase chain reaction) using E2 
and K2 primers. All the desired mutations were confirmed by sequencing. For 
each mutant, 24 colonies were analyzed for phytase activity at various times after 

15 induction. All of the three mutants, Mutant R, Mutant U, and Mutant Y, along 
with r-AppA, were expressed and secreted, resulting in a time-dependent 
accumulation of extracellular phytase activity that reached plateau at 96 h after 
methanol induction. The plateau activity in the medium supernatant was 35, 1 75, 
57, and 1 1 7 U/mL, respectively (Table 3). Yeast X33 transformed with the 

20 expression vector pPICZaA was used as a control and did not give any activity or 
phytase protein in SDS-PAGE. On the purified protein basis, Mutant U had the 
highest specific phytase activity, 63 U/mg, followed by Mutant Y, r-AppA and 
• Mutant R (51, 41 and 32 U/mg protein, respectively). The protein yield recovered 
after purification was 654, 324, 688 and 425 mg/L for the Mutants U and Y, r- 

25 AppA and Mutant R, respectively (Table 3). 
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TABLE 3 

Phytase yield and specific activity of r-AppA and the three mutants 



Protein Phytase activity 1 Protein yield 2 Specific activity 3 









-Endo H f 


+Endo H f 


r-AppA 


117± 15 


688 ±44 


41 ±3 


37±4 


R 


35±4 


425 ± 26 


32 ±2 


29±2 


U 


175 ±19 


654 ± 39 


63 ±4* 


65 ±5* 


Y 


57 ±8 


324 ±18 


51 ± 5 


46 ±6 



': Phytase activity (U/ml) in GMMY media after 96 h of culture. 

2 : Protein yield (milligrams of purified protein per liter of culture). 

3 : Specific phytase activity (units per milligram of purified protein). 

* Indicates significant difference (P < 0.05) versus the r-AppA control. Results are 
representative of three experiments. 



In SDS-PAGE, the band size of the purified r-AppA was 50-56 
kDa, while that of Mutant R was 68-70 kDa and that of Mutant Y was 86-90 kDa 
(Figure 2). This gave an enhancement of the glycosylation level from 14% in r- 
AppA to 48% in Mutant R and 89% in Mutant Y. The level of glycosylation in 
Mutant U appeared equivalent to that of r-AppA. All of these recombinant 
enzymes showed similar molecular mass, 45 to 48 kDa, after deglycosylation by 
Endo H/. Deglycosylation did not significantly affect the specific activity for all 
the mutants or r-AppA (Table 3). However, treating these purified proteins with 
both P-mercaptoethanol and Endo Hy caused a complete loss of phytase activity. 



5 



10 



15 
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Examnle 6 - Effects of Site-Directed Mutagenesis on Phytase pH and 

Temperature Optima and Thermostability 

Although Mutants R, U, and Y shared the same pH optimum (2.5) 
5 with that of r-AppA, Mutant U was more (p<0.05), while Mutant Y was less 
(p<0.05), active than r-AppA at the pH 3.5, 4.5, and 5.5 (Figure 3). The 
temperature optimum was 65°C for Mutant U and 55°C for the other two mutants 
and r-AppA. In 0.2 M glycine-HCl, pH 2.5, Mutant U exhibited a higher (p<0.05) 
residual phytase activity than that of r-AppA after being heated at 80 and 90°C for 
10 15 min (Figure 4). 

Example 7 - Effects of Site-Directed Mutagenesis on Enzyme Kinetics 

The K m value for pNPP (p-nitrophenyl phosphate) was reduced by 
one-half and the one for sodium phytate by 70% with Mutant U, versus r-AppA {P 

1 5 < 0.05) (Table 4). Consequently, Mutant U demonstrated a 1 .9-fold increase in its 
apparent catalytic efficiency k ca /K m for pNPP and a 5.2-fold increase for sodium 
phytate than that of r-AppA. Although the k ca /K m values for Mutant Y were also 
significantly different from those of r-AppA for sodium phytate, the actual 
enhancement was relatively small. In contrast, Mutant R demonstrated a 

20 significantly lower catalytic efficiency than that of r-AppA for both substrates. 
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TABLE 4 

Catalytic properties of r-AppA and the three mutants' 



Substrate 



10 



15 



20 



Enzyme 




pNPP 






Na-Phytate 




Km 

(mM) 


^cat 

(min" 1 ) 


k ca /K m 
(min 1 M* 1 ) 


Km 

(mM) 


k cat 
(min* 1 ) 


(min 1 M" 1 ) 


r-AppA 


3.66 ± 
0.44 


752 ± 
7.9 


(2.0 ± 0.18) 
x 10 3 


1.95 + 
0.25 


21481 
33 


(1.11 10.13) 
x 10 6 


R 


7.87 + 
0.84* 


390 ± 
5.9* 


(0.5 ± 0.07) 
xlO 5 * 


3.071 
0.26* 


1657 + 
23* 


(0.54 10.09) 
x!0 6 * 


U 


1.86 + 
35* 


1073 ± 
13* 


(5.8±0.37) 
xlO 5 * 


0.58 + 
0.08* 


4003 + 
56* 


(6.9010.70) 
xlO 6 * 


Y 


3.18 ± 
0.39 


787 ± 
6.7 


(2.510.17) 
x 10* 


2.03 + 
0.19 


3431 + 
41* 


(1.6910.21) 
xlO 6 ' 



1 Reaction velocity measurements were performed in triplicate as described herein. The 
values of K m were calculated using the Lineweaver-Burk plot method. All reactions were 
measured in 0.25 M glycine-HCl, pH 2.5. 

* Indicates significant difference (P < 0.05) versus the r-AppA control . Results are 
representative of five independent experiments. 



The above results indicate that additional N-glycosylation sites 
and/or other amino acid changes can be added to the AppA enzyme by 
site-directed mutagenesis. Compared with the r-AppA produced by the intact 
appA gene, the mutant enzymes R and Y clearly demonstrated enhanced 
glycosylation, as shown by their differences in molecular masses before and after 
deglycosylation. Thus, the engineered N-glycosylation sites in these two mutants 
were indeed recognized by P. pastoris and processed correctly. Because of the 
multiple mutations in Mutants R and Y, these results cannot assess the level of 
glycosylation at specific engineered sites, but useful information can be derived 
by comparisons between the mutants and r-AppA. First, although both Mutants R 
and Y had four additional N-glycosylation sites with respect to r-AppA, Mutant Y 
displayed greater than 40% more N-glycosylation than R (89% vs 48%). Because 
the substitution C200N in Mutant Y was the only difference between these two 
variants and that mutation added no additional putative N-glycosylation site, it 
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seems that changing C200N itself might enhance N-glycosylation at certain sites. 
Second, although Mutant U had two additional N-glycosylation sites (Asn 207 and 
Asn 211), its apparent molecular weight was the same as r-AppA, suggesting the 
two engineered glycosylation sites in Mutant U were silent. This demonstrates 
5 that although the presence of such a signal sequence is required for glycosylation, 
it does not necessarily result in glycosylation (Meldgaard, M. et al., Microbiol. 
140:159-66 (1994), which is hereby incorporated by reference). Possibly, the 
residues mutated in the case of Mutant U were not as solvent accessible as the 
structure-based sequence alignment would lead one to believe. The recently 

10 published crystal structure of the AppA enzyme may help answer this question 
(Lim et al., Nat. Struct. Biol. 7: 1 08-1 3 (2000); Jia, Z. et al., Acta Crvstallogr. D 
Biol. Crvstallogr. 54:647-49 (1998), which are hereby incorporated by reference). 
Lastly, Mutant R had a significant increase in glycosylation compared with that of 
Mutant U. The difference might be caused by the two added N-glycosylation sites 

15 at A131N and V134N in Mutant R. Given the above results, the following 
observations can be made: 1) the substitutions A131N and V134N result in 
increased glycosylation of the AppA enzyme; 2) the substitutions D207N and 
S21 IN were silent; 3) the substitution C200N appeared to enhance glycosylation 
at other sites in the case of Mutant Y, but not in Mutant U. 

20 In general, additional glycosylation of proteins has been shown to 

facilitate folding and increase stability (Haraguchi, M. et al., Biochem. J. 312:273- 
80 (1995); Imperiali, B. et al., Proc. Natl. Acad. Sci. USA. 92:97-1 12 (1995), 
which are hereby incorporated by reference). Contrary to expectations, Mutants R 
and Y did not demonstrate enhanced thermostability, despite elevated levels of 

25 glycosylation. Surprisingly, Mutant U displayed a greater thermostability despite 
having the same level of glycosylation as r-AppA. Although performing C200N 
does not mean that N-glycosylation at other sites has occurred, greater 
glycosylation at specific sites is feasible. Seemingly, the mutations per se rather 
than glycosylation had contributed to this effect. A recent study described the 

30 production of six different phytases expressed in either Aspergillus niger or the 
yeast Hamenula polymorpha (Wyss, M. et al., Appl. Environ. Microbiol. 65:359- 
66 (1999), which is hereby incorporated by reference). The results indicated that 
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levels of glycosylation depended on the host chosen, but had no significant effect 
on thermostability, specific activity or protein refolding (Wyss, M. et al., Appl. 
Environ. Microbiol. 65:359-66 (1999), which is hereby incorporated by 
reference). 

5 The kinetic data indicate that all the three mutants and r-AppA had 

lower K m and higher k ca /K m for sodium phytate than for pNPP. Clearly, these 
recombinant enzymes have higher apparent efficiency for the former than the 
latter, demonstrating that the AppA enzyme is more a phytase than acid 
phosphatase (Lim et ah, Nat. Struct. Biol. 7:108-13 (2000); Rodriguez, E. et ah, 

1 0 Biochem. Biophvs. Res. Commun. 257:11 7-23 ( 1 999), which are hereby 

incorporated by reference). Mutant U exhibited the largest enhancement in its 
apparent efficiency for both substrates over that of r-AppA. The enhancement in 
k ca /K m is most likely due to a large decrease in K m (1.86 vs 3.66 mM forpNPP 
and 0.58 vs 1 .95 mM for sodium phytate). . This means that the Mutant U is 

1 5 saturated at a much lower concentration of substrate than r-AppA. In addition, 
there was also a significant difference in k cat for both substrates between these two 
forms of phytase. Based on the structure of rat acid phosphatase (Schneider, G. et 
al, EMBO J. 12:2609-15 (1993), which is hereby incorporated by reference), 
these mutations do not seem to be involved in the enzyme active site or the 

20 formation of acid phosphatase dimer. Probably, these mutations singly or jointly 
affect the conformational flexibility of the enzyme, such as described previously 
for another protein (Kern, G. et al., Protein Sci. 2:1862-68 (1993), which is hereby 
incorporated by reference). Based on the recently solved crystal structures of E. 
coli phytase (Lim et al., Nat. Struct. Biol. 7:108-13 (2000); Jia, Z. et al., Acta 

25 Crvstalloer. D Biol. Crvstallogr. 54:647-49 ( 1 998), which are hereby incorporated 
by reference), none of these mutations are directly involved in the substrate- 
binding pocket. However, C200 and C210, labeled as CI 78 and C 1 88 by Lim et 
al. (Lim et al., Nat. Struct. Biol. 7:108-13 (2000), which is hereby incorporated by 
reference), are involved in a disulfide bond between helix G and the GH loop in 

30 the a-domain of the protein (Lim et al., Nat. Struct. Biol. 7: 1 08-1 3 (2000), which 
is hereby incorporated by reference). With the mutation C200N, the unique 
disulfide bond into the a-domain is no longer present in the GH loop. This 
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change may result in a better flexibility of the a-domain toward the central cavity 
or "substrate-binding site" of the enzyme (Lim et al., Nat. Struct. Biol 7:108-13 
(2000), which is hereby incorporated by reference). This internal flexibility may 
be also supported by the fact that Mutant U, and to a lesser extent Mutant Y, 
5 demonstrated improvement in the catalytic efficiency for sodium phytate 

hydrolysis. Since there was no enhanced glycosylation for Mutant U, engineered 
glycosylation sites N207 and N21 1, labeled as Dl 85 and S189 by Lim et al. (Lim 
et al., Nat. Struct. Biol. 7:108-13 (2000), which is hereby incorporated by 
reference), may be masked from the exposed surface. The improvement of 
1 0 thermostability for Mutant U may be therefore explained by an increasing number 
of hydrophobic interactions not presented in Mutant Y or Mutant R. 

It is worth mentioning that the specific activities of phytase in all 
the three mutants and r-AppA were not significantly affected by deglycosylation. 
However, deglycosylation, as shown in glycoprotein hormones (Terashima, M. et 

15 al., Eur. J. Biochem. 226:249-54 (1994), which is hereby incorporated by 
reference) or the Schwanniomyces occidentalis a-amylase expressed in S. 
cerevisiae (Han, Y. et al., Appl. Environ. Microbiol. 65: 1 91 5-1 8 ( 1 999), which is 
hereby incorporated by reference), may be associated with possible 
conformational changes that modulate the substrate binding and (or) the velocity 

20 of its utilization. All of the mutants and the intact control were completely 
inactivated by both P-mercaptoethanol and deglycosylation treatments. This 
suggests that the four disulfide bonds play altogether a key role in maintaining 
catalyic function of these recombinant phytases (Ullah, A.H.J, et al., Biochem. 
Biophvs. Res. Commun. 227:3 11-17 (1 996), which is hereby incorporated by 

25 reference). 

In conclusion, when the G helix and the GH loop do not contain the 
disulfide bond C200/C210 in Mutant U, the a-domain may become slightly more 
flexible, resulting in a positive modulation on the catalytic efficiency and the 
thermostability of the enzyme. Because the E. coli phytase crystal structure will 
30 be released in the near future (Lim et al., Nat. Struct. Biol. 7: 108-13 (2000), which 
is hereby incorporated by reference), more targeted mutagenesis studies should 



WO 01/36607 PCT/US00/31622 

-35- 

shed light on conformational changes that may improve the properties of the 
enzyme. 

Although preferred embodiments have been depicted and described 
in detail herein, it will be apparent to those skilled in the relevant art that various 
5 modifications, additions, substitutions, and the like can be made without departing 
from the spirit of the invention and these are therefore considered to be within the 
scope of the invention as defined in the claims which follow. 



WO 01/36607 



PCT/US00/31622 



-36- 



WHAT IS CLAIMED: 



1 . An isolated mutant acid phosphatase/phytase produced by 
making a plurality of amino acid substitutions in a wild-type Escherichia coli acid 

5 phosphatase/phytase having an amino acid sequence of SEQ. ID. No. 1, said 
amino acid substitutions comprising substitutions at positions 200, 207, and 21 1. 

2. The isolated mutant acid phosphatase/phytase according to 
claim 1 , wherein the amino acid substitution at position 200 is an Asn amino acid 
residue for a Cys amino acid residue, the amino acid substitution at position 207 is 

10 an Asn amino acid residue for an Asp amino acid residue, and the amino acid 
substitution at position 211 is an Asn amino acid residue for a Ser amino acid 
residue, said isolated mutant acid phosphatase/phytase having an amino acid 
sequence of SEQ. ID. No. 3. 

3. The isolated mutant acid phosphatase/phytase according to 
1 5 claim 1 , wherein the isolated mutant acid phosphatase/phytase is in pure form. 

4. The isolated mutant acid phosphatase/phytase according to 
claim 1, wherein the isolated mutant acid phosphatase/phytase is recombinant. 

5. A method for improving enzymatic properties of a wild- 
type Escherichia coli acid phosphatase/phytase having an amino acid sequence of 

20 SEQ. ID. No. 1, said method comprising: 

altering the amino acid sequence of said wild-type acid 
phosphatase/phytase by introducing amino acid substitutions into SEQ. ID. No. 1 
at positions 200, 207, and 211. 

6. The method according to claim 5, wherein the amino acid 
25 substitution at position 200 is an Asn amino acid residue for a Cys amino acid 

residue, the amino acid substitution at position 207 is an Asn amino acid residue 
for an Asp amino acid residue, and the amino acid substitution at position 21 1 is 
an Asn amino acid residue for a Ser amino acid residue, said amino acid 
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substitutions resulting in a mutant acid phosphatase/phytase having an amino acid 
sequence of SEQ. ID. No. 3. 

7. The method according to claim 5, wherein the improved 
enzymatic property is enhanced thermostability. 

5 8. The method of claim 5, wherein the improved enzymatic 

property is greater phytase activity at a pH range of between about pH 3.5 to about 
pH 5.5. 

9. An isolated DNA molecule encoding a mutant acid 
phosphatase/phytase according to claim 1 . 

10 10. The isolated DNA molecule according to claim 9, wherein 

the wild-type acid phosphatase/phytase is isolated from Escherichia coli. 

11. The isolated DNA molecule according to claim 10, wherein 
the DNA molecule comprises a nucleotide sequence of SEQ. ID. No. 4 or 
hybridizes to a DNA molecule comprising the nucleotide sequence of SEQ. ID. 

1 5 No. 4 under stringency conditions comprising hybridization at 42°C in a 
hybridization medium comprising 5X SSPE and 50 percent formamide with 
washing at 50°C with 0.5X SSPE. 

12. The isolated DNA molecule according to claim 9, wherein 
the amino acid substitution at position 200 is an Asn amino acid residue for a Cys 

20 amino acid residue, the amino acid substitution at position 207 is an Asn amino 
acid residue for an Asp amino acid residue, and the amino acid substitution at 
position 211 is an Asn amino acid residue for a Ser amino acid residue, said amino 
acid substitutions resulting in a mutant acid phosphatase/phytase having an amino 
acid sequence of SEQ. ID. No. 3. 

25 1 3 . A recombinant DNA expression system comprising a DNA 

molecule according to claim 9. 

14. The expression system according to claim 13, wherein the 
DNA molecule is in a heterologous expression vector. 



♦ 
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1 5. The expression system according to claim 1 3, wherein the 
DNA molecule is inserted into the expression system in proper orientation and 
correct reading frame. 

16. A host cell comprising a heterologous DNA molecule 
5 according to claim 9. 

1 7. The host cell according to claim 1 6, wherein said 
heterologous DNA molecule has the nucleotide sequence of SEQ. ID. No. 4. 

1 8. The host cell according to claim 1 6, wherein said 
heterologous DNA molecule is in a recombinant DNA expression system. 

10 19. The host cell according to claim 1 6, wherein said host cell 

is a yeast cell. 

20. The host cell according to claim 1 9, wherein the yeast cell 
is of a strain selected from the group consisting of Saccharomyces, 
Kluyveromyces, Torulaspora, and Schizosaccharomyces. 

15 21. The host cell according to claim 1 9, wherein the yeast cell 

is a methylotrophic yeast strain. 

22. The host cell according to claim 2 1 , wherein the 
methylotrophic yeast strain is selected from the group consisting of Pichia, 
Hansenula, Torulupsis, Candida, and Karwinskia. 

20 23. A method of recombinantly producing a mutant acid 

phosphatase/phytase comprising: 

transforming a host cell with at least one heterologous DNA 
molecule according to claim 9 under conditions suitable for expression of the 
mutant acid phosphatase/phytase and 

25 isolating the mutant acid phosphatase/phytase. 
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24. The method according to claim 23, wherein the host cell is 

a yeast cell. 

25. The method according to claim 24, wherein the yeast cell is 
of a strain selected from the group consisting of Saccharomyces, Kluyveromyces, 

5 Torulaspora, and Schizosaccharomyces. 

26. The method according to claim 24, wherein the yeast cell is 
a methylotrophic yeast strain. 

27. The host cell according to claim 26, wherein the 
methylotrophic yeast strain is selected from the group consisting of Pichia, 

10 Hansenula, Torulupsis, Candida, and Karwinskia. 

28. An animal feed composition comprising the isolated mutant 
acid phosphatase/phytase according to claim 1 . 

* 

29. A method for producing animal feed comprising: 

introducing the isolated mutant acid phosphatase/phytase 
15 according to claim 1 into animal feed under conditions effective to produce an 
animal feed composition. 

30. An isolated mutant acid phosphatase/phytase which differs 
from a wild-type Escherichia coli acid phosphatase/phytase having an amino acid 
sequence of SEQ. ID. No. 1 by at least one amino acid substitution which disrupts 

20 disulfide bond formation between Cys amino acid residues at positions 200 and 
210. 

3 1 . The isolated mutant acid phosphatase/phytase according to 
claim 30, wherein the isolated mutant acid phosphatase/phytase is in pure form. 

32. The isolated mutant acid phosphatase/phytase according to 
25 claim 30, wherein the isolated mutant acid phosphatase/phytase is recombinant. 
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33. A method for improving enzymatic properties of a wild- 
type Escherichia coli acid phosphatase/phytase having an amino acid sequence of 
SEQ. ID. No. 1, said method comprising: 

altering the amino acid sequence of said wild-type acid 
5 phosphatase/phytase by introducing at least one amino acid substitution which 
disrupts disulfide bond formation between Cys amino acid residues at positions 
200 and 210. 

34. The method according to claim 33, wherein the improved 
enzymatic property is enhanced thermostability. 

1 0 35. The method of claim 33, wherein the improved enzymatic 

property is greater phytase activity at a pH range of between about pH 3.5 to about 
pH 5.5. 

36. An isolated DNA molecule encoding a mutant acid 
phosphatase/phytase according to claim 30. 

15 37. A recombinant DNA expression system comprising a DNA 

molecule according to claim 36. 

38. The expression system according to claim 37, wherein the 
DNA molecule is in a heterologous expression vector. 

39. The expression system according to claim 3 7, wherein the 
20 DNA molecule is inserted into the expression system in proper orientation and 

correct reading frame. 

40. A host cell comprising a heterologous DNA molecule 
according to claim 36. 

41 . The host cell according to claim 40, wherein said 
25 heterologous DNA molecule is in a recombinant DNA expression system. 

42. The host cell according to claim 40, wherein said host cell 

is a yeast cell. 
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43. The host cell according to claim 42, wherein the yeast cell 
is of a strain selected from the group consisting of Saccharomyces , 
Kluyveromyces, Torulaspora, and Schizosaccharomyces. 

44. The host cell according to claim 42, wherein the yeast cell 
5 is a methylotrophic yeast strain. 

45. The host cell according to claim 44, wherein the 
methylotrophic yeast strain is selected from the group consisting of Pichia, 
Hansenula, Torulupsis, Candida, and Karwinskia. 

46. A method of recombinantly producing a mutant acid 
1 0 phosphatase/phytase comprising: 

transforming a host cell with at least one/heterologous DNA 
molecule according to claim 36 under conditions suitable for expression of the 
mutant acid phosphatase/phytase and 



a yeast cell. 

48. The method according to claim 47, wherein the yeast cell is 
of a strain selected from the group consisting of Saccharomyces, Kluyveromyces, 
Torulaspora, and Schizosaccharomyces, 



isolating the mutant acid phosphatase/phytase. 



15 



47. The method according to claim 46, wherein the host cell is 



20 



49. The method according to claim 47, wherein the yeast cell is 



a methylotrophic yeast strain. 



50. The host cell according to claim 49, wherein the 
methylotrophic yeast strain is selected from the group consisting of Pichia, 
Hansenula, Torulupsis, Candida, and Karwinskia. 



25 5 1 . An animal feed composition comprising the isolated mutant 

acid phosphatase/phytase according to claim 30. 
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52. A method for producing animal feed comprising: 

introducing the isolated mutant acid phosphatase/phytase 
according to claim 30 into animal feed under conditions effective to produce an 
animal feed composition. 
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SEQUENCE LISTING 
<110> Cornell Research Foundation, Inc. 

<120> SITE -DIRECTED MUTAGENESIS OF ESCHERICHIA COLI PHYTASE 

<130> 19603/4032 

<140> 
<141> 

<150> 60/166,179 
<151> 1999-11-18 

<160> 9 / 

<170> Patentln Ver. 2.1 

<210> 1 
<211> 432 
<212> PRT 

<213> Escherichia coli 
<400> 1 

Met Lys Ala He Leu He Pro Phe Leu Ser Leu Leu He Pro Leu Thr 
15 10 15 

Pro Gin Ser Ala Phe Ala Gin Ser Glu Pro Glu Leu Lys Leu Glu Ser 

20 25 30 

Val Val He Val Ser Arg His Gly Val Arg Ala Pro Thr Lys Ala Thr 
35 40 45 

Gin Leu Met Gin Asp Val Thr Pro Asp Ala Trp Pro Thr Trp Pro Val 
50 55 60 

Lys Leu Gly Trp Leu Thr Pro Arg Gly Gly Glu Leu He Ala Tyr Leu 
65 70 75 80 

Gly His Tyr Gin Arg Gin Arg Leu Val Ala Asp Gly Leu Leu Ala Lys 

85 90 95 

Lys Gly Cys Pro Gin Pro Gly Gin Val Ala He He Ala Asp Val Asp 

100 105 110 

Glu Arg Thr Arg Lys Thr Gly Glu Ala Phe Ala Ala Gly Leu Ala Pro 
115 120 125 
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Asp Cys Ala lie Thr Val His Thr Gin Ala Asp Thr Ser Ser Pro Asp 
130 135 140 

Pro Leu Phe Asn Pro Leu Lys Thr Gly Val Cys Gin Leu Asp Asn Ala 
145 150 155 160 

Asn Val Thr Asp Ala lie Leu Ser Arg Ala Gly Gly Ser He Ala Asp 

165 170 175 

Phe Thr Gly His Arg Gin Thr Ala Phe Arg Glu Leu Glu Arg Val Leu 

180 185 190 

Asn Phe Pro Gin Ser Asn Leu Cys Leu Lys Arg Glu Lys Gin Asp Glu 
195 200 205 

Ser Cys Ser Leu Thr Gin Ala Leu Pro Ser Glu Leu Lys Val Ser Ala 
210 215 220 

Asp Asn Val Ser Leu Thr Gly Ala Val Ser Leu Ala Ser Met Leu Thr 
225 230 235 240 

Glu He Phe Leu Leu Gin Gin Ala Gin Gly Met Pro Glu Pro Gly Trp 

245 250 255 

Gly Arg He Thr Asp Ser His Gin Trp Asn Thr Leu Leu Ser Leu His 

260 265 270 

Asn Ala Gin Phe Tyr Leu Leu Gin Arg Thr Pro Glu Val Ala Arg Ser 
275 280 285 

Arg Ala Thr Pro Leu Leu Asp Leu He Lys Thr Ala Leu Thr Pro His 
290 295 300 

Pro Pro Gin Lys Gin Ala Tyr Gly Val Thr Leu Pro Thr Ser Val Leu 
305 310 315 320 

Phe He Ala Gly His Asp Thr Asn Leu Ala Asn Leu Gly Gly Ala Leu 

325 330 335 

Glu Leu Asn Trp Thr Leu Pro Gly Gin Pro Asp Asn Thr Pro Pro Gly 

340 345 350 

Gly Glu Leu Val Phe Glu Arg Trp Arg Arg Leu Ser Asp Asn Ser Gin 
355 360 365 

Trp lie Gin Val Ser Leu Val Phe Gin Thr Leu Gin Gin Met Arg Asp 
370 375 380 
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Lys Thr Pro Leu. 
385 

Leu Ala Gly Cys 
Gly Phe Thr Gin 

420 



Ser Leu Asn Thr 
390 

Glu Glu Arg Asn 
405 

lie Val Asn Glu 



Pro Pro Gly Glu 
395 

Ala Gin Gly Met 
410 

Ala Arg lie Pro 
425 



Val Lys Leu Trir 

4O0 

Cys Ser Leu Ala 
415 

Ala Cys Ser Leu 
430 



<210> 2 
<211> 1486 
<212> DNA 

<213> Escherichia coli 
<400> 2 

taaggagcag aaacaatgtg gtatttactt tggttcgtcg gcattttgtt gatgtgttcg 60 
ctctccaccc ttgtgttggt atggctggac ccgcgattga aaagttaacg aacgtaggcc 120 
tgatgcggcg cattagcatc gcatcaggca atcaataatg tcagatatga aaagcggaaa 180 
catatcgatg aaagcgatct taatcccatt tttatctctt ctgattccgt taaccccgca 240 
atctgcattc gctcagagtg agccggagct gaagctggaa agtgtggtga ttgtcagccg 300 
tcatggtgtg cgtgccccaa ccaaggccac gcaactgatg caggatgtca ccccagacgc 360 
atggccaacc tggccggtaa aactgggttg gctgacacca cgcggtggtg agctaatcgc 420 
ctatctcgga cattaccaac gccagcgtct ggtggccgac ggattgctgg cgaaaaaggg 480 
ctgcccgcag cctggtcagg tcgcgattat tgtcgatgtc gacgagcgta cccgtaaaac 540 
aggcgaagcc ttcgccgccg ggctggcacc tgactgtgca ataaccgtac atacccaggc 600 
agatacgtcc agtcccgatc cgttatttat tcctctaaaa actggcgttt gccaactgga 660 
taacgcgaac gtgactgacg cgatcctcag cagggcagga gggtcaattg ctgactttac 720 
cgggcatcgg caaacggcgt ttcgcgaact ggaacgggtg cttaattttc cgcaatcaaa 780 
cttgtgcctt aaacgtgaga aacaggacga aagctgttca ttaacgcagg cattaccatc 840 
ggaactcaag gtgagcgccg acaatgtttc attaaccggt gcggtaagcc tcgcatcaat 900 
gctgacggaa atatttctcc tgcaacaagc acagggaatg ccggagccgg ggtggggaag 960 
gatcactgat tcacaccagt ggaacacctt gctaagtttg cataacgcgc aattttattt 1020 
actacaacgc acgccagagg ttgcccgcag tcgcgccacc ccgttattgg atttgatcaa 1080 
gacagcgttg acgccccatc caccgcaaaa acaggcgtat ggtgtgacat tacccacttc 1140 
agtgctgttt attgccggac acgatactaa tctggcaaat ctcggcggcg cactggagct 1200 
caactggacg cttccaggtc agccggataa cacgccgcca ggtggtgaac tggtgtttga 1260 
acgctggcgt cggctaagcg ataacagcca gtggattcag gtttcgctgg tcttccagac 1320 
tttacagcag atgcgtgata aaacgccgct atcattaaat acgccgcccg gagaggtgaa 1380 
actgaccctg gcaggatgtg aagagcgaaa tgcgcagggc atgtgttcgt tggccggttt 1440 
tacgcaaatc gtgaatgaag cgcgcatacc ggcgtgcagt ttgtaa 1486 



<210> 3 
<211> 432 
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<212> PRT 

<213> Escherichia coli 
<400> 3 

Met Lys Ala lie Leu lie Pro Phe Leu Ser Leu Leu lie Pro Leu Trir 
15 io is 

Pro Gin Ser Ala Phe Ala Gin Ser Glu Pro Glu Leu Lys Leu Glu Ser 

20 25 30 

Val Val lie Val Ser Arg His Gly Val Arg Ala Pro Thr Lys Ala Trir 
35 40 45 

Gin Leu Met Gin Asp Val Thr Pro Asp Ala Trp Pro Thr Trp Pro Val 
50 55 60 

Lys Leu Gly Trp Leu Thr Pro Arg Gly Gly Glu Leu lie Ala Tyr Leu 
65 70 75 80 

Gly His Tyr Gin Arg Gin Arg Leu Val Ala Asp Gly Leu Leu Ala Lys 

85 90 95 

Lys Gly Cys Pro Gin Pro Gly Gin Val Ala lie lie Ala Asp Val Asp 

100 105 no 

Glu Arg Thr Arg Lys Thr Gly Glu Ala Phe Ala Ala Gly Leu Ala Pro 
115 120 125 

Asp Cys Ala lie Thr Val His Thr Gin Ala Asp Thr Ser Ser Pro Asp 
130 135 140 

Pro Leu Phe Asn Pro Leu Lys Thr Gly Val Cys Gin Leu Asp Asn Ala 
145 150 155 160 

Asn Val Thr Asp Ala lie Leu Ser Arg Ala Gly Gly Ser lie Ala Asp 

165 170 175 

Phe Thr Gly His Arg Gin Thr Ala Phe Arg Glu Leu Glu Arg Val Leu 

180 185 190 

Asn Phe Pro Gin Ser Asn Leu Asn Leu Lys Arg Glu Lys Gin Asn Glu 
195 200 205 

Ser Cys Asn Leu Thr Gin Ala Leu Pro Ser Glu Leu Lys Val Ser Ala 
210 215 220 

Asp Asn Val Ser Leu Thr Gly Ala Val Ser Leu Ala Ser Met Leu Thr 
225 230 235 240 
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Glu lie Phe Leu Leu Gin Gin Ala Gin Gly Met Pro Glu Pro Gly Trp 

245 250 255 

Gly Arg lie Thr Asp Ser His Gin Trp Asn Thr Leu Leu Ser Leu His 

260 265 270 

Asn Ala Gin Phe Tyr Leu Leu Gin Arg Thr Pro Glu Val Ala Arg Ser 
275 280 285 

Arg Ala Thr Pro Leu Leu Asp Leu lie Lys Thr Ala Leu Thr Pro His 
290 295 300 

Pro Pro Gin Lys Gin Ala Tyr Gly Val Thr Leu Pro Thr Ser Val Leu 
305 310 315 320 

Phe lie Ala Gly His Asp Thr Asn Leu Ala Asn Leu Gly Gly Ala Leu 

325 330 335 

Glu Leu Asn Trp Thr Leu Pro Gly Gin Pro Asp Asn Thr Pro Pro Gly 

340 345 350 

Gly Glu Leu Val Phe Glu Arg Trp Arg Arg Leu Ser Asp Asn Ser Gin 
355 360 365 

Trp lie Gin Val Ser Leu Val Phe Gin Thr Leu Gin Gin Met Arg Asp 
370 375 380 

Lys Thr Pro Leu Ser Leu Asn Thr Pro Pro Gly Glu Val Lys Leu Thr 
385 390 395 400 

Leu Ala Gly Cys Glu Glu Arg Asn Ala Gin Gly Met Cys Ser Leu Ala 

405 410 415 

Gly Phe Thr Gin He Val Asn Glu Ala Arg He Pro Ala Cys Ser Leu 

420 425 430 



<210> 4 
<211> 1486 
<212> DNA 

<213> Escherichia coli 
<400> 4 

taaggagcag aaacaatgtg gtatttactt tggttcgtcg gcattttgtt gatgtgttcg 60 
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ctctccaccc ttgtgttggt atggctggac 
tgatgcggcg cattagcatc gcatcaggca 
catatcgatg aaagcgatct taatcccatt 
atctgcattc gcfccagagtg agccggagct 
tcatggtgtg cgtgccccaa ccaaggccac 
atggccaacc tggccggtaa aactgggttg 
ctatctcgga cattaccaac gccagcgtct 
ctgcccgcag cctggtcagg tcgcgattat 
aggcgaagcc ttcgccgccg ggctggcacc 
agatacgtcc agtcccgatc cgttatttat 
taacgcgaac gtgactgacg cgatcctcag 
cgggcatcgg caaacggcgt ttcgcgaact 
cttgaacctt aaacgtgaga aacagaatga 
ggaactcaag gtgagcgccg acaatgtttc 
gctgacggaa atatttctcc tgcaacaagc 
gatcactgat tcacaccagt ggaacacctt 
actacaacgc acgccagagg ttgcccgcag 
gacagcgttg acgccccatc caccgcaaaa 
agtgctgttt attgccggac acgatactaa 
caactggacg cttccaggtc agccggataa 
acgctggcgt cggctaagcg ataacagcca 
tttacagcag atgcgtgata aaacgccgct 
actgaccctg gcaggatgtg aagagcgaaa 
tacgcaaatc gtgaatgaag cgcgcatacc 



ccgcgattga aaagttaacg aacgtaggcc 120 
atcaataatg tcagatatga aaagcggaaa 180 
tttatctctt ctgattccgt taaccccgca 240 
gaagctggaa agtgtggtga ttgtcagccg 300 
gcaactgatg caggatgtca ccccagacgc 360 
gctgacacca cgcggtggtg agctaatcgc 420 
ggtggccgac ggattgctgg cgaaaaaggg 480 
tgtcgatgtc gacgagcgta cccgtaaaac 540 
tgactgtgca ataaccgtac atacccaggc 600 
tcctctaaaa actggcgttt gccaactgga 660 
cagggcagga gggtcaattg ctgacfcttac 720 
ggaacgggtg cttaattttc cgcaatcaaa 780 
aagctgtaac ttaacgcagg cattaccatc 840 
attaaccggt gcggtaagcc tcgcatcaat 900 
acagggaatg ccggagccgg ggtggggaag 960 
gctaagtttg cataacgcgc aattttattt 1020 
tcgcgccacc ccgttattgg atttgatcaa 1080 
acaggcgtat ggtgtgacat tacccacttc 1140 
tctggcaaat ctcggcggcg cactggagct 12 00 
cacgccgcca ggtggtgaac tggtgfcttga 1260 
gtggattcag gtttcgctgg tcttccagac 132 0 
atcattaaat acgccgcccg gagaggtgaa 1380 
tgcgcagggc atgtgttcgt tggccggttt 1440 
ggcgtgcagt ttgtaa i486 



<210> 5 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
Oligonucleotide 

<400> 5 

ggaattcgct cagagccgga 20 



<210> 6 

<211> 31 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: 
Oligonucleotide 



6 
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<400> 6 

ctgggtatgg ttggttatat tacagtcagg t 



<210> 7 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence 
Oligonucleotide 

<400> 7 

caaacttgaa ccttaaacgt gag 

<210> 8 
<211> 31 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence 
Oligonucleotide 

<400> 8 

cctgcgttaa gttacagctt tcattctgtt t 



<210> 9 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence 
Oligonucleotide 

<400> 9 

ggggtacctt acaaactgca eg 
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